...jL, M,: }j ""ir "• if" .„iL, 3i„ }i m jj' ..u.^ iuii 

SPECIFICATION 

Electronic Version 1.2.8 
Stylesheet Version 1 .0 

[INPUT BUFFER MANAGEMENT 
FOR THE PLAYBACK CONTROL 

FOR MPs PLAYERS] 

Background of Invention 

[0001 ] 1 . Field of the Invention 

[0002] The present invention relates to input buffer management of the playback control 
for MP3 (Motion Pictures Experts Group Layer III Audio) players. More specifically, a 
method that initially stores incomplete main.data from MP3 frames in the input buffer 
until enough main.data is available to correctly decode a frame, allowing the decoding 
of an MPS file beginning at an entry point in the MP3 file other than the start of the 
MP3 file is disclosed. 

[0003] 2. Description of the Prior Art 

[0004] MPEG-1 audio layer III (MP3) is an ISO/IEC (International Organization for 

Standardization and International Electrotechnical Commission) standard of audio 
coding with high quality as well as high efficiency. 

[0005] Fig. 1 is a block diagram of a traditional MP3 player 10 for reading MP3 files from 
medium sources such as an optical disk. The MP3 player 1 0 has a parser 1 01 , an input 
buffer 102, and an MP3 decoder 103. The MP3 decoder 103 decodes the frames 
moved from the medium source to the input buffer 1 02 by the parser. 

[0006] 

Referring to Fig. 2, an MP3 file contains a number of frames. Each frame is a unit of 
data that is enough for decoding and generating a certain amount of audio samples. 
Take frame 7 for example. The frame 7 includes a header 71 , an error check 72, a 
sample data 73, and an external data 74. The header 71 is a 32-bit stream including a 
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12-bit synchronization word. In addition, the header 71 has 2 bits for sampling 
frequency, 4 bits for bitrate, and other Information. The error check 72 is an optional 
1 6-blt stream for error detection. Sample data 73 includes necessary information for 
decoding audio samples of this frame. External data 74 includes data other than audio 
information. Sample data 73 includes main^data_begin 731, audio_data 732, and 
main.data 733. The main_data_begin 731 will be described later. The audio.data 732 
contains parametric Information about the decoding algorithm and main.data 733 
contains data of the encoded audio samples. The size of audlo.data 732 and 
main.data 733 may vary In different frames. 

[0007] Not all kinds of audio information require the same amount of space after 

compression. For example, a certain time duration of silence requires less storage 
space than the same time duration of complex orchestra music when being 
compressed. Therefore when encoding a bit stream of MP3, the MP3 standard allows 
an overflow of the main data 733 from one frame into the unused space of prior 
frames so as to fully utilize the bandwidth. 

[0008] For example, the size of the main.data 733 of a certain frame is 1 00 bytes and 

the required size of the audio samples is 203 bytes. The 1 03 bytes of overflow will be 
allocated to a main_data of prior frames, such as a main.data 633 of a frame 6. If the 
size of the main.data 633 is still not enough for storage of the 103 bytes, a main.data 
533 of a frame 5 will be used. The main_data^begin 731 will have a value of 1 03 
Indicating the size of the overflow is 103 bytes. If the size of the main.data 633 is 
only 70 bytes, then there will still be another 33 bytes of overflow. The 33 bytes of 
overflow will again be allocated to the main.data 533 of the frame 5. To avoid an 
error, the MP3 standard requests that the main.data.begin of the first frame of an 
MP3 file, which is frame 1 in Fig. 2, must be zero. 

[0009] 

To correctly decode the frames, the MP3 decoder 1 03 may need the data of 
several frames beforehand when decoding a particular frame. Thus, the MP3 decoder 
1 03 prefers to decode the frames including the first frame of the MP3 file. However, in 
some playback modes, such as AB repeat mode or reverse mode, the parser 101 may 
provide frames which do not start from the first frame of the MP3. In some other 
cases, the MP3 file is damaged and the main.data.begin does not have a correct 
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value. Since the traditional MP3 decoder 103 directly decodes all the frames 
regardless of the content of the main_data_begin of the frame, the decoder 103 
cannot correctly decode the frames and a "bang" noise may occur as the MP3 player 
1 0 starts playing. 

Summary of Invention 

[001 0] It is therefore a primary objective of the claimed invention to provide an input 
buffer management method for the playback control for MP3 players to solve the 
problem of a "bang" noise while allowing the decoding and playing of the first frame 
that can be decoded and played correctly, minimizing lost sound. 

[001 1] Briefly summarized, the preferred embodiment of the claimed invention discloses 
initializing a totalizer to zero, finding a synchronization word of a frame, decoding a 
header, calculating the size of a main^data, and decoding the enclosed audio data. 
Then, a check is performed to determine if the value in the totalizer is greater or equal 
to a value contained in a main_data_begin of this frame. If the totalizer is less than the 
main.data^begin of this frame, add the calculated length of main^data to the totalizer, 
store data from this frame into an input buffer, and repeat the process starting with 
finding the synchronization word of a next downstream frame until the totalizer is 
greater or equal to the main_data_begin of a then current frame. At this point, all of 
the audio data required by the frame is currently available and decoding and playing 
can continue as normal. 

[001 2] It is an advantage that in the input buffer management method of the claimed 
invention that the "bang" noise is avoided and a first frame that can be correctly 
decoded and played is decoded and played regardless of the entry point into the file 
or the direction in which the file Is played. 

[001 3] These and other objectives of the claimed invention will no doubt become obvious 
to those of ordinary skill in the art after reading the following detailed description of 
the preferred embodiment, which is illustrated in the various figures and drawings. 

Brief Description of Drawings 

[0014] Fig.l is a diagram of an MP3 (Motion Pictures Experts Group Layer III Audio) frame 
according to a prior art. 
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[001 5] Fig. 2 is a block diagram of a traditional MP3 player. 

[001 6] Fig. 3 is a flowchart according to one embodiment of the present invention. 

Detailed Description 

[001 7] The present invention allows the correct playing of an MPS file from any frame in 
the MP3 file and in either direction while eliminating the "bang" noise. The present 
method only needs to be invoked when beginning to play an MP3 file and works by 
properly selecting a first frame for which all data associated with the frame to be 
decoded has been read. MP3 file decoding methods are well known in the art and will 
not be discussed here. 

[001 8] Fig. 3 illustrates a flow chart of one embodiment according to the present 

invention. The input buffer 102 associated with the decoding process is flushed so as 
to clear itself of unwanted data. In the preferred embodiment, two variables are used. 
It should be noted that using other quantities or types of variables or accumulators 
such as registers or counters still falls within the spirit of the invention. In this paper, 
the term totalizer is used and is defined here as any device that allows maintenance of 
a running total. 

[001 9] In Fig. 3, a variable W may be of a Boolean type and is used to denote whether or 
not a starting frame has been located so that the current frame can be fully and 
correctly decoded and played. In this embodiment, W is initialized to false, (for 
example, a value of zero). A totalizer A is used as a totalizer to hold a sum of the size 
of main-data fields which has already been placed into the input buffer 1 02. The 
totalizer A is also initialized to zero. After initialization (step 901), the parsing of an 
MP3 bitstream can begin. 

[0020] ^^^^ ^^^p involves searching the bit stream for a first occurrence of a 1 2- 

bit synchronization word. When the synchronization word 71 1 has been located, the 
header 71 of the frame 7 is decoded providing bitrate and other miscellaneous 
information(step 903). The header 71 includes a bit signifying whether or not error 
checking is to be used within the frame. If error checking is used in this MP3 file, the 
error check 72 is read next (step 904), but. as with any MP3 player 10, using the error 
check 72 is merely a design choice and the present invention is not limited to the 
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reading or not reading of this field. Next, the sample data 73 is decoded, the 
main_data_begin 731 is read (step 905). Then the size of the nnain_data 733 Is 
calculated (step 905), 

[0021] At this point, the critical decision in the selection of a starting frame begins. A 
check is made to see if the variable W is still false (for example, equal to zero), 
meaning that as of yet, no starting frame has been found (step 907). If the variable W 
is true (for example, not equal to zero), then a starting frame has been found, no 
further decisions are necessary, and further decoding and outputting proceed as 
normal (step 913). However, if the variable W is still false, it is not known if enough 
data has been read to further process and play this frame. 

[0022] If the variable W is still false, two things about the bit stream are known. One Is 

that no starting frame has yet been found (W = 0) and the other is the amount of data 
that has been already read and placed into the input buffer so far. The totalizer A 
records the amount of data read into the input buffer 1 02. The main_data_begln 731 
of the current frame 7 is used to indicate whether or not the audio samples stored in 
the current main^data 733 are complete. If the audio samples stored are not complete 
and there are still some bytes stored in previous frames, the amount of the overflow 
will be indicated in the main_data_begin 731. For example, if the required sample 
is 203 bytes and the size of the current main^data 733 is 100 bytes, a value of 103 
will be stored in the main_data_begin 731, indicating there are still 103 bytes stored 
in the main_data field of previous frames. Therefore, the MP3 decoder 103 needs to 
locate and decode the preceding bytes in previous frames. The main_data_begin 731 
specifies the location as a negative offset in bytes. A check Is made to see if the 
totalizer A (the amount of data already read into the input buffer 1 02) is greater than 
or equal to the main_data_begin of the current frame. 

[0023] 

While the parser 101 parsed the series of frames to the decoder 1 03, the parser 
101 will also notify the decoder 103 whether to decode the audio samples from the 
beginning of an MP3 file, or from the middle of an MP3 file (step 908). In a first 
circumstance when the MP3 player needs to play the MP3 file from the middle of the 
MP3 file, if the totalizer A is greater than or equal to the main_data_begin 731 of the 
current frame 7, it is guaranteed that enough data has already been read and stored in 
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the input buffer 102 to correctly process and decode the current frame 7. Therefore, 
the variable W is changed to be true (for example, a non-zero value), signifying that 
the bit stream may be correctly processed from then on (step 91 3). 

[0024] On the other hand, if value stored in the totalizer A is less than the value stored in 
the main_data_begin 731 of the current frame 7, not enough data has been read and 
placed into the input buffer to guarantee that the current frame 7 can be processed 
and played correctly. Therefore, the size of the current main.data 733 of this frame 7 
is added to the value contained in the totalizer A and the result is stored in the 
totalizer A (step 91 1). Additionally, the main.data 733 from the frame is stored in the 
input buffer 1 02 for later referencing (step 91 2). 

[0025] Since the starting frame has not yet been found, the decoder 103 should repeat 
the above steps, beginning with locating the synchronization word of the next frame, 
decoding the header, sample data, and others. If the value of the then current frame"s 
main_data_begin is still larger than the totalizer A, the above accumulation process is 
repeated, which means the size of the main^data of the then current frame is added to 
totalizer A and the decoder starts to find a next synchronization word again. The 
process is repeated until the decoder 103 finds a frame in which the main_data_begin 
in the frame is less or equal to the totalizer A. At that point, the decoder 1 03 has 
enough data and it may start processing data and outputting samples. 

[0026] In a second circumstance when the MP3 player 10 needs to play the MP3 file from 
the beginning of the MP3 file, since the normal encoding ensures the main_data_begin 
of the first frame being zero, we can use this as an error detection criteria. In other 
words, if the main_data_begin of the first frame is not zero, the decoder discards this 
frame and search for the synchronization word in the next frame (step 902). The 
decoder 103 repeats this process until it finds the frames with a main«data_begin 
having a value of zero. 

[0027] In contrast to the prior art, the present invention does not have to skip frames 

read to guarantee correct processing and playing of the MP3 bitstream upon startup. 
Neither does the present invention discard frames until a legal frame, one that 
contains all the main_data associated with the frame, is found. Both methods can 
result in a noticeable loss of startup audio and reduced performance. 
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[0028] Those skilled in the art will readily observe that numerous modifications and 

alterations of the device may be made while retaining the teachings of the invention. 
Accordingly, the above disclosure should be construed as limited only by the metes 
and bounds of the appended claims. 
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